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Monomeric Protein of the TGF-B Family 

Description 

The present invention concerns a biologically active protein from the TGF-B 
superfamily, wherein this protein remains in monomeric form due to 
substitution or deletion of a cysteine which is reponsible for the dimerization 
in the wild-type protein. Further the invention concerns a nucleic acid, 
which codes for a protein according to the invention, an expression vector 
containing such nucleic acid and a host cell, containing a corresponding 
nucleic acid or an expression vector, said nucleic acid being suitable for the 
expression of the protein. The invention also concerns a pharmaceutical 
composition containing the protein according to the invention or a nucleic 
acid coding therefor. The use of the pharmaceutical composition according 
to the invention concerns the prevention or treatment of all conditions 
which can also be treated with the dimeric form of the corresponding 
protein. 


Many growth factors from the TGF-/? superfamily (Kingsley, Genes and 
Development 8, 1 33-1 46 (1 994) as well as the references cited therein) are 
relevant for a wide range of medical treatment methods and applications 
which in particular concern promotion of cell proliferation and tissue 
formation, including wound healing and tissue reproduction. Such growth 
factors in particular comprise members of the TGF-/? (transforming growth 
factor, cf . e.g. Roberts and Sporn, Handbook of Experimental Pharmacology 
95 (1990), page 419-472, editors: Sporn and Roberts), the DVR-group 
(Hotten et al., Biochem. Biophys. Res. Comm. 206 (1995), page 608-613 
and further literature cited therein) including BMPs (bone morphogenetic 
protein, cf . e.g. Rosen and Thies, Growth Factors in Perinatal Development 
(1993), page 39-58, editors: Tsang, Lemons and Balistreri) and GDFs 
(growth differentiation factors), the inhibin/activin (cf. e.g. Vale et al. # The 
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Physiology of Reproduction, second edition (1994), page 1861-1878, 
editors: Knobil and Neill) and the GDNF protein family (Rosenthal, Neuron 
22 (1999), page 201-203; Airaksinen et al. Mol Cell Neurosci 13 (1999), 
page 31 3-325). Although the members of the TGF-B superfamily show high 
amino acid homologies in the mature part of the protein, in particular 7 
conserved cysteines, they show considerable variations in their exact 
functions. Often individual growth factors of these families exhibit a 
plurality of functions at the same time, so that their application is of interest 
in various medical indications. Some of these multifunctional proteins also 
have survival promoting effects on neurons in addition to functions such as 
e.g. regulation of the proliferation and differention in many cell types 
(Roberts and Sporn, supra; Sakurai etal., J. Biol. Chem. 269 (1994), page 
141 18-14122). Thus e.g. trophic effects on embryonic motoric and sensory 
neurons were demonstrated for TGF-jff in vitro (Martinou et al., Devi. Brain 
Res. 52, page 1 75-1 81 (1 990) and Chalazonitis et al.. Dev. Biol. 1 52, page 
121-132 (1992)). In addition, effects promoting survival are shown for 
dopaminergic neurons of the mid-brain for the proteins TGF-/M, -2, -3, 
activin A and GDNF (glial cell line-derived neurotrophic factor), a protein 
which has structural similarities to TGF-0 superfamily members, these 
effects being not mediated via astrocytes (Krieglstein et al., EMBO J. 14, 
page 736-742 (1995)). 

Interesting members of the TGF-B superfamily or active variants thereof 
comprise the TGF-Bproteins like TGF-B 1 , TGF-S2, TGF-B3, TGF-B4, TGF-B5 
(U.S. 5,284,763; EP 0376785; U.S. 4.886,747; DNA 7 (1988), page 1-8), 
EMBO J. 7 (1988), page 3737-3743), Mol. Endo. 2 (1988), page 1186- 
1195). J. Biol. Chem. 265 (1990), page 1089-1093), OP1, OP2 and OP3 
proteins (U.S. 5,011,691, U.S. 5,652,337, WO 91/05802) as well as 
BMP2, BMP3, BMP4 (WO 88/00205, U.S. 5,013,649 and WO 89/10409, 
Science 242 (1988), page 1528-1534), BMP5, BMP6 and BMP-7 (OP1) 
(Proc. Natl. Acad. Sci. 87 (1990), page 9841-9847, WO 90/1 1366), BMP8 
(OP2) (WO 91/18098), BMP9 (WO 93/00432), BMP 10 (WO 94/26893), 
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BMP11 (WO 94/26892), BMP12(WO 95/16035), BMP13 (W095/16035), 
BMP 15 (WO 96/36710), BMP 16 (WO 98/12322), BMP3b (Biochem. 
Biophys. Res. Comm.. 219 (1996), page 656-662), GDF1 (WO 92/00382 
and Proc. Natl. Acad. Sci. 88 (1991), page 4250-4254), GDF8 (WO 
94/21681). GDF10 (W095/10539), GDF1 1 (WO 96/01845). GDF5 
(CDMP1, MP52) (WO 95/04819; WO96/01316; WO 94/15949, WO 
96/14335 and WO 93/16099 and Nature 368 (1994), page 639-643), 
GDF6 (CDMP2, BMP1 3) (WO 95/01 801 , WO 96/1 4335 and W095/1 6035), 
GDF7 (CDMP3, BMP12) (WO 95/01802 and WO 95/10635), GDF14 (WO 
97/36926), GFD15 (WO 99/06445), GDF16 (WO 99/06556), 60A 
(Proc.Natl. Acad. Sci. 88 (1991), page 9214-9218), DPP (Nature 325 
(1987), page 81-84), Vgr-1 (Proc. Nati. Acad. Sci. 86 (1989), page 4554- 
4558) Vg-1, (Cell 51 (1987). page 861-867), dorsalin (Cell 73 (1993), page 
687-702), MIS (Cell 45 (1986), page 685-698), pCL13 (WO 97/00958), 
BIP (WO 94/01557), inhibin a, activin BA and activin BB (EP 0222491), 
activin BC (MP121) (WO 96/01316), activin BE and GDF12 (WO 96/02559 
and WO 98/22492), activin BD (Biochem. Biophys. Res. Comm. 210 
(1 995), page 581-588), GDNF (Science 260 (1993), page 1 130-1 1 32, WO 
93/061 16), Neurturin (Nature 384 (199 6), page 467-4/0), Persephin 
(Neuron 20 (1998), page 245-253, WO 97/33911), Artemin (Neuron 21 
(1998), page 1291-1302). Mic-1 (Proc. Natl. Acad. Sci USA 94 (1997), 
page 11514-11519), Univin (Dev. Biol. 1 66 (1 994), page 1 49-1 58), ADMP 
(Development 121 (1995), page 4293-4301). Nodal (Nature 361 (1993), 
page 543-547), Screw (Genes Dev. 8 (1994), page 2588-2601). Other 
useful proteins include biologically active biosynthetic constructs including 
biosynthetic proteins designed using sequences from two or more known 
morphogenetic proteins. Examples of biosynthetic constructs are disclosed 
in U.S. 5,011.691 (e.g. COP-1, COP-3, COP-4, COP-5, COP-7 and COP- 
16). The disclosure of the cited publications including patents or patent 
applications are incorporated herein by reference. 
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The occurence of proteins of the TGF-/? superfamily in various tissuous 
stages and development stages corresponds with differences with regard 
to their exact functions as well as target sites, life span, requirements for 
auxiliary factors, necessary cellular physiological environment and/or 
resistance to degradation. 

The proteins of the TGF-£ superfamily exist as homodimers or heterodimers 
having a single disulfide bond. This disulfide bond is mediated by a specific 
and in most of the proteins conserved cysteine residue of the respective 
monomers. Up to now it was considered as indispensible for the biological 
activity that the protein is present in its dimeric form. Several publications 
indicated that biological activity can only be obtained for dimeric proteins 
and it was speculated that this dimer formation is important for further 
polymer formation of two or more dimers to achieve intercellular signal 
transmission by simultaneous binding to type I and type II receptors for the 
TGF-/? superfamily proteins on cells. It was assumed that only this 
simultaneous binding to both kinds of receptors would allow for effective 
intercellular signal transmission for the benefit of the patient {Bone, volume 
19 (1996), page 569-574)! 

A disadvantage of the use of these proteins as medicaments and their 
production is, that they are not readily obtainable in biologically active and 
sufficiently pure form by recombinant expression in prokaryots without 
intensive renaturation procedures. 

Thus it was the object of the present invention to provide a simple and 
inexpensive possibility to reproducibly produce proteins exhibiting high 
biological activity, wherein this biological activity should essentially 
correspond to that of the dimers of the proteins of said families. 

This object is solved according to the invention by a protein selected from 
the members of the TGF-S protein superfamily, such protein being 
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necessarily monomeric due to substitution or deletion of a cysteine which 
is responsible for dimeric formation. 


Surprisingly it has been found that the substitution or deletion of the 
cysteine, which normally effects the dimerization in the proteins, results 
upon expression and correct folding (proper formation of the intramolecular 
disulfide bridges) in a monomeric protein that retains the biological activity 
of the dimeric form. Even more surprisingly, it was found that at least some 
of the monomeric proteins show a higher activity, based on the weight of 
protein, than their respective dimeric forms. Apart from this improved 
biological activity an important advantage for the proteins according to the 
invention is that they can be expressed in a large amount in prokaryotic 
hosts and upon simple refolding of the monomers they are obtained in high 
purity and very high yield without the need to separate dimerized from non- 
dimerized (monomeric) protein. The findings of the present invention are 
very surprising since, as already mentioned above, it was common 
understanding that only a dimer of the morphogenetic proteins has 
biological activity. Despite this understanding the proteins according to the 
invention show an up to two - fo l d hi g he. activity than that of the d.mer on 


the basis of protein weight. The smaller size of the proteins of the 
invention, while maintaining the biological activity, can also be considered 
as advantageous, e.g. for applications concerning the brain since the 
monomeric protein can much easier pass the blood-brain-barrier than the 
dimeric form. 


The proteins according to the invention encompass all proteins of the 
mentioned protein families that are normally present in dimeric form. Also 
parts of such proteins that retain substantial activity or fusion proteins or 
precursor forms of proteins shall be considered as encompassed by the 
present invention as well as biologically active naturally occurring or 
biosynthetic variants of TGF-B superfamily proteins, as long as they show 
at least considerable biological activity. 
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ln a preferred embodiment of the present invention the monomeric protein 
is a mature protein or a biologically active part or variant thereof. The term 
"biologically active part or variant thereof" is meant to define either 
fragments retaining activity, precursor proteins that are e.g. cleaved at the 
site of activity to the mature form or show biological activity themselves, 
or also variants that still maintain essentially the biological activity of the 
wild-type protein. Such variants preferably contain conservative amino acid 
substitutions, but especially at the N-terminal part of the mature proteins 
even considerable deletions or substitutions do not lead to a considerable 
loss of biological activity. It is well within the skill of the man in the art to 
determine whether a certain protein shows the required biological activity. 
Proteins showing at least 70% and preferably at least 80% homology to the 
mature wild-type proteins of the above referenced protein families should 
be understood as encompassed by the present invention, as long as they 
contain the deletion or substitution of a cysteine, as required for the 
proteins according the invention, and therefore do not form dimers. 

It is especially preferred that proteins according to the invention contain at 
least the 7 cysteine region characteristic for the TGF-0 protein superf amily. 

This specific 7 cysteine region is considered to be the most important part 
of the proteins in view of the biological activity. Therefore proteins retaining 
this critical region are preferred proteins according to the invention. It is 
disclosed in the state of the art which cysteine is responsible in a certain 
protein family or protein for dimer formation (see for example: Schlunegger 
& Grutter ( 1 992) Nature 358, 430-434; Daopin et aL, ( 1 992) Science 257, 
369-373 and Griffith et at., Proc. Natl. Acad. Sci. 93 (1996), page 878- 
883). This cysteine has to be deleted or substituted by another amino acid 
to form a protein according to the invention. 

The 7 cysteine region is known for many proteins of the TGF-/? protein 
superfamily. In this region the respective location of the cysteine residues 
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to each other is important and is only allowed to vary slightly in order not 
to lose the biological activity. Consensus sequences for such proteins are 
known in the state of the art and all proteins complying with such 
consensus sequences are considered to be encompassed by the present 
invention. 

In an especially preferred embodiment of the present invention the protein 
contains a consensus sequence according to the following sequence 

C (Y)^ C Y Y Y C (Y) 26 , 35 X C (YW CYC (Formula I). 

wherein C denotes cysteine. Y denotes any amino acid including cysteine 
and X denotes any amino acid except cysteine. 

More preferably the protein according to the invention contains a consensus 
sequence according to the following sequence 

C (Y) 28 C Y Y Y C (Y)^ X C (Y} 31 CYC (Formula II), 
wherein C, X and Y have the same meaning as defined above. 


Even more preferably the protein according to the invention contains a 
consensus sequence according to the following sequence 

C (X) 28 C X X X C (X) 31 .3 3 C (X) 31 C X C (Formula III), 
wherein C and X have the same meaning as defined above. 

In these consensus sequences especially preferred distances between the 
respective cysteine residues are contained, wherein also already the dimer 
forming cysteine is substituted by another amino acid. As with all proteins 
of said protein superfamily the location of and distance between the 
cysteines is more important than the identity of the other amino acids 
contained in this region. Therefore, the consensus sequence shows the 
respective location of the cysteines, but does not show the identity of the 
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other amino acids, since these other amino acids are widely variable in the 
proteins of the TGF-£ protein superfamily. 


In a preferred embodiment of the present invention the monomeric protein 
according to the invention is a morphogenetic protein. 

Most of the members of the TGF-B protein superfamily are morphogenetic 
proteins that are useful for treatments where regulation of differentiation 
and proliferation of cells or progenitor cells is of interest. This can result in 
replacement of damaged and/or diseased tissue like for example skeletal 
(bone, cartilage) tissue, connective tissue, periodontal or dental tissue, 
neural tissue, tissue of the sensory system, liver, pancreas, cardiac, blood 
vessel and renal tissue, uterine or thyroid tissue etc. Morphogenetic 
proteins are often useful for the treatment of ulcerative or inflammatory 
tissue damage and wound healing of any kind such as enhanced healing of 
ulcers, burns, injuries or skin grafts. Especially preferred proteins according 
to the invention belong to the TGF-& BMP. GDF, activin or GDNF families. 
Several BMP proteins which were originally discovered by their ability to 
induce bone formation, have been described, as also indicated above. 
Meanwhile, several additional functions have been found as it is also true 
for members of the GDFs. These proteins show a very broad field of 
applications and especially are in addition to their bone and cartilage growth 
promoting activity (see for example: WO 88/00205, WO 90/11366, WO 
91/05802) useful in periodontal disease, for inhibiting periodontal and tooth 
tissue loss, for sealing tooth cavities, for enhancing integration of a tooth 
in a tooth socket {see for example: WO 96/26737, WO 94/06399, WO 
95/24210), for connective tissue such as tendon or ligament (see for 
example: WO 95/1 6035), for improving survival of neural cells, for inducing 
growth of neural cells and repairing neural defects, for damaged CNS tissue 
due to stroke or trauma (see for example: WO 97/34626. WO 94/03200. 
WO 95/05846). for maintaining or restoring sensory perception (see for 
example WO 98/20890, WO 98/20889), for renal failure (see for example: 
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WO 97/41 880, WO 97/41881), for liver regeneration (see for example WO 
94/06449), for regeneration of myocardium (see for example WO 
98/27995), for treatment or preservation of tissues or cells for organ or 
tissue transplantation, for integrity of gastrointestinal lining (see for example 
5 WO 94/06420), for increasing progenitor cell population as for example 
hematopoietic progenitor cells by ex vivo stimulation (see for example WO 
92/15323), etc. One preferred member of the GDF family is the protein 
MP52 which is also termed GDF-5 or CDMP-1. Applications for MP52 
reflect several of the already described applications for the BMP/GDF family. 

10 MP52 is considered to be a very effective promoter of bone and cartilage 
formation as well as connective tissue formation (see for example WO 
95/04819, Hotten et al., (1996), Growth Factors 13, 65-74, Storm et al., 
(1994) Nature 368, 639-643, Chang et al., (1994) J. Biol. Chem. 269 (45), 
28227-28234). In this connection MP52 is useful for applications 

15 concerning the joints between skeletal elements (see for example Storm & 
Kingsley (1996) Development 122, 3969-3979). One example for 
connective tissue is tendon and ligament (Wolfman et al., (1997), J. Clin. 
Invest. 100, 321-330, Aspenberg & Forslund (1999), Acta Orthop Scand 
70, 51-54, WO 95/16035). MP52 is also useful f or tooth (dental and 

20 periodontal) applications (see for example WO 95/0481 9, WO 93/1 6099, 
Morotome et al. (1998), Biochem Biophys Res Comm 244, 85-90). MP52 
is useful in wound repair of any kind. It is in addition very useful for 
promoting tissue growth in the neuronal system and survival of 
dopaminergic neurons, for example. MP52 in this connection is useful for 

25 applications in neurodegenerative diseases like e.g. Parkinson's disease and 
possibly also Alzheimer's disease for Huntington chorea tissues (see for 
example WO 97/03188, Krieglstein et aL, (1995) J. Neurosci Res. 42, 724- 
732, Sullivan et al., (1 997) Neurosci Lett 233, 73-76, Sullivan et al. ( 1 998), 
Eur. J. Neurosci 10, 3681-3688). MP52 allows to maintain nervous 

30 function or to retain nervous function in already damaged tissues. MP52 is 
therefore considered to be a generally applicable neurotrophic factor. It is 
also useful for diseases of the eye, in particular retina cornea and optic 
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nerve (see for example WO 97/031 88, You et al. (1 999,. ,nvest Opthalmol 
Vis Sci 40, 296-31 1,. The monomeric MP52 is expected to show all the 
already described activities of the dimeric form as well as some further 
described activities as described for the dimeric BMP/GDF family members 
it is expected to be for example also useful for increasing progenitor cel. 
populates and for stimulating differentiation of progenitor cells ejLyivo. 
Progenitor cells can be cells which take part in the cartilage formation 
process or hematopoietic progenitor cells. It is also useful for damaged or 
diseased tissue where a stimulation of angiogenesis is advantageous (see 
for example: Yamashita et al. (1997), Exp Cell Res 235, 218-226). 

An especially preferred protein according to the invention therefore is 
protein MP52 or a biologically active part or variant thereof. Like in the 
already above mentioned definition of these terms MP52 can e.g. be used 
in its mature form, however, it can also be used as a fragment thereof at 
least containing the 7 cysteine region or also in a precursory form 
Deviations at the N-terminal part of mature MP52 do not affect its activity 
to a considerable degree. Therefore, substitutions, deletions or additions on 
the N-terminal part of the proteins are still within the scope of the present 
invention. It might be useful to add a peptide to the N-terminal part of the 
protein, e.g. for purification reasons. It might not be necessary to cleave off 
this added peptide after expression and purification of the protein 
Additional peptides at the N- or C-terminal part of the protein may also 
serve for the targeting of the protein to special tissues such as nerve or 
bone tissue or for the penetration of the blood/brain barrier. Generally, also 
fus.on proteins of a monomeric protein according to the invention and 
another peptide or group are considered within the scope of the present 
-nvention, wherein these other peptides or groups are directing the 
localization of the fusion protein, e.g. because of an affinity to a certain 
t.ssue type etc. Examples for such fusion proteins are described in WO 
97/23612. The protein containing such addition will retain its biological 
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activity at least as long as such addition does not impair the formation of 
the biologically active conformation of the protein. 

In an especially preferred embodiment of the present invention the proteins 
s comprises the amino acid sequence according to SEQ.ID.NO.1 (DNA and 
protein sequence) and SEQ.ID.No.2 (protein sequence, only), respectively. 
SEQ.ID.NO.2 shows the complete protein sequence of the prepro protein 
of human MP52, as already disclosed in WO 95/04819. The start of the 
mature protein lies preferably in the area of amino acids 352-400, especially 
o preferred at amino acids 381 or 382. Therefore, the mature protein 
comprises amino acids 381-501 or 382-501 . The first alanine of the mature 
protein can be deleted and the mature protein then preferably comprises 
amino acids 383-501. The cysteine at position 465 that is present in the 
already described dimeric MP52 protein is according to the invention either 
deleted or substituted by another amino acid. This deletion or substitution 
is represented by Xaa at the respective position in SEQ.ID.Nos.1 and 2. 

The activin/inhibin family proteins are of interest for applications related to 
contraception, fertility and preg na ncy ( co p for n r nm pl n W » ' J47 1 'JA ' jj, U.C. 
5,102,868). They are also of interest for applications like repair or 
prevention of diseases of the nervous system, they can be used in the 
repair of organ tissue such as liver and even in bone and cartilage, too. In 
this connection MP121 (activin BC) is especially useful in applications for 
growth or regeneration of damaged and/or diseased tissue, especially the 
liver tissue, neural tissue, skeletal tissue (see for example WO 96/01316, 
WO 98/22492 and WO 97/03188). MP121 is known to be predominantly 
expressed in the liver whereby the mRNA is markedly reduced after partial 
hepatectomy. MP121 is expected to regulate the liver mass (Zhang et al.. 
Endocrine Journal 44 (1997), page 759-764). The monomeric MP121 
shows all the already described activities of the dimeric form as well as 
some further described activities as described for the dimeric TGF-S 
superfamily members. It is for example also expected to be useful in 
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treatment of ulceration (for example stomach ulceration) and useful for 
integrity of gastrointestinal lining and for stimulating differentiation of 
progenitor cells ex vivo, treatment or preservation of mammalian tissue or 
cells, e.g. for organ or tissue transplantation. 

A further preferred protein according to the invention therefore is MP121, 
a member of the activin/inhibin protein family. Also for this protein a 
biologically active part or variant thereof is encompassed by the present 
invention according to the above defined rules. An especially preferred 
embodiment is shown in SEQ.ID.NO.3 (DNA and protein sequence) and 
SEQ.ID.N0.4 (protein sequence, only) respectively. SEQ.ID.NO.4 shows the 
complete amino acid sequence of the prepro protein of human MP1 21 , that 
has already been disclosed in WO 96/01316. The start of the mature 
protein lies preferably between amino acids 217 and 247, most preferred 
at amino acid 237. A preferred mature protein therefore comprises the 
mature part of the protein starting at amino acid 237 and ending at amino 
acid 352. However, also the precursor protein comprising the whole shown 
amino acid sequence is encompassed by the present invention. The cysteine 
at position 316 is according to the invention either deleted or substituted 
by another amino acid, being represented by Xaa in SEQ.ID.Nos.3 and 4. 

The amino acid by which the cysteine residue effecting the dimerization is 
substitued can be selected by any amino acid that does not impair the 
formation of a biologically active conformation. The amino acid is preferably 
selected from the group of alanine, serine, threonine, leucine, isoleucine, 
glycine and valine. 


The proteins according to the invention are in summary characterized by the 
absence of the cysteine residue in the amino acid sequence responsible for 
the dimer formation. This absence can be effected by substitution of this 
cysteine by another amino acid or by deletion. In case of deletion, however, 
it must be assured for the protein that the formation of the biologically 
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active conformation is not hindered. The same is true for the selection of 
the substitution amino acid, wherein it is preferred to use an amino acid 
which has a form similar to cysteine. 

5 The monomeric proteins according to the invention can be easily produced, 
in particular by expression in prpkaryots and renaturation according to 
known methods. It is advantageous that the protein can be obtained in 
exceedingly biologically active form. The proteins exhibit in monomeric form 
about the same activity as the dimer so that based on the amount of active 
io substance only the same amount of the monomeric protein has to be used 
in order to obtain the same positive biological effects. 


A further subject matter of the present invention is a nucleic acid encoding 
a protein according to the invention. It is obvious that the nucleic acid has 
15 to have such a sequence that a deletion or substitution of the cysteine 
responsible for the dimer formation is achieved. The nucleic acid can be a 
naturally occurring nucleic acid, but also a recombinantly produced or 
processed nucleic acid. The nucleic acid can be both a DNA sequence and 

an RNA sequence, as long as the protein according to th<» invpntinn ran ho 

20 obtained from this nucleic acid upon expression in a suitable system. 

In a preferred embodiment of the invention the nucleic acid is a DNA 
sequence. This DNA sequence in an especially preferred embodiment of the 
invention comprises a sequence as shown in SEQ.ID.NO.1 and 

25 SEQ.ID.NO.3, respectively, or parts thereof. SEQ.ID.NO.1 shows a nucleic 
acid encoding MP52, wherein the codon for the cysteine responsible for the 
dimer formation is replaced by another codon which does not encode 
cysteine or deleted. This substitution or deletion is shown as "nnn" in the 
sequence protocols. SEQ.ID.NO.3 shows a nucleic acid encoding MP121, 

30 wherein also the codon for the cysteine effecting the dimer formation is 
replaced by a respective different codon or deleted. Instead of the complete 
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sequences of SEQ.ID.NOs.1 or 3 also parts can be used that encode the 
mature proteins or fragments also described above. 

It is preferred in the framework of the present invention that the nucleic 
s acid apart from the coding sequences also contains expression control 
sequences. Such expression control sequences are known to the man 
skilled in the art and serve to control the expression of the encoded protein 
in a host cell. The host cell does not have to be an isolated cell, moreover, 
the nucleic acid can be expressed in the patient in vivo in the target tissue.' 
10 This can be done by inserting the nucleic acid into the cell genome, 
however, it is also possible to transform host cells with expression vectors 
containing a nucleic acid according to the invention. Such expression 
vectors are a further subject matter of the present invention, wherein the 
nucleic acid is inserted in a suitable vector system, the vector system being 
s selected according to the desired expression of the protein. The vector 
system can be a eukaryotic vector system, but - in the framework of the 
present invention - it is preferably a prokaryotic vector system, with which 
the proteins can be produced in prokaryotic host cells in a particularly easy 
and pure manner. In addition, the expression vector can be a viral vector. 


) 


Also host cells in turn are a further subject matter of the present invention. 
The host cells are characterized in that they contain a nucleic acid according 
to the invention or an expression vector according to the invention and that 
they are able to use the information present in the nucleic acids and in the 
expression vector, respectively, for the expression of a monomeric protein 
according to the invention. 


Although in the framework of the present invention also eukaryotic host 
cells are suitable for the production of the protein, it is, as mentioned 
already several times above, particularly advantageous that the protein 
according to the invention can be produced in prokaryotic host cells, which 
therefore represent a preferred embodiment of the present invention. 
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After such preferred expression in prokaryotic host cells the protein is 
purified and renatured according to known methods, thereby effecting 
intramolecular cystine bridge formation. 

Since, however, not only in vitro production of the monomeric protein is 
possible, but also in vivo expression of a nucleic acid according to the 
invention, a further preferred embodiment is a eukaryotic host cell, and 
especially a eukaryotic host cell containing the DNA in its genome, or as an 
expression vector. Such host cell can also be useful for application to an 
individual in need of morphogenic treatment. 

Further subject matters of the present application are pharmaceutical 
compositions comprising at least one monomeric protein according to the 
invention or at least one nucleic acid encoding for such a protein or at least 
one corresponding expression vector, or at least one eukaryotic host cell 
expressing the monomeric protein. 

The protein itself, but also a nucleic acid according to the invention, an 
expression vector nr a hn n t cn l l nn» l m nimg lHarAH tn hA art V a n taqpmif; at 
active substances in a pharmaceutical composition. Also combinations of 
monomeric proteins, with either biological activities in the same or different 
applications, can be used in preferred pharmaceutical compositions. 
Especially preferred for neuronal applications are combinations of MP52 
with other TGF-jf? superfamily proteins, both in monomeric form, like for 
example with GDNF (see WO 97/03188). Also preferred for neuronal 
applications are combinations of TGF-jS with GDNF, both in monomeric 
form. Also for applications concerning cartilage and/or bone the 
combination of several monomeric proteins might be useful, like MP52 with 
a protein of TGF-jff (see e.g. WO 92/09697) or MP52 with a cartilage 
maintenance-inducing protein such as BMP-9 (see e.g. WO 96/39170). 
When a nucleic acid or an expression vector is used, however, it has to be 
ensured that when administering to the patient there has to be an 
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environment in which the nucleic acid and the expression vector, 
respectively, can be expressed and the protein according to the invention 
can be produced in vivo at the site of action. The same applies accordingly 
to the host cell according to the invention. When using expression vectors 
or host cells it is also possible that they encode more than one monomeric 
protein of the invention to produce a combination of two or more 
monomeric proteins. 

It is advantageous to both the protein and the nucleic acid or the expression 
vector or the host cell when they are applied in and/or on a biocompatible 
matrix. The matrix material can be transplanted into the patient, e.g. 
surgically, wherein the protein either is effective on the surface of the 
matrix material or the protein or the DNA encoding the protein can be 
slowly released from the matrix material and then be effective over a long 
period of time. Additionally it is possible and advantageous to use a 
biodegradable matrix material in the pharmaceutical composition, wherein 
this material preferably dissolves during the protein induced tissue formation 
so that a protein or a nucleic acid contained therein is released and the 
newly formed tissue replaces the matrix material. 

Finally, in case of applications relating to bone formation, it is advantageous 
to use a matrix material which is itself e.g. osteogenically active. By using 
such a matrix material it becomes possible to achieve a synergistic effect 
of protein and matrix material and to effect a particularly rapid and effective 
bone formation. 

An especially preferred matrix material that can be used according to the 
invention is a matrix material as described in U.S. 5,231,169 and U.S. 
5,776,193 and especially for applications like spinal fusion. 

When using a combination of a matrix material and protein and/or nucleic 
acid and/or expression vector, it is preferable to sterilize such a combination 
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prior to its use. The matrix and the morphogeny protein can be separat ly 
stenhzed and then combined, but it is preferred to terminally sterilize the 
device consisting of matrix and morphogenetic protein. Terminal sterilization 
can be achieved by ionizing radation as already described for dimeric 
« proteins (U.S. 5,674,292) but it is also advantageous to use ethylene oxide. 

Of course this invention also comprises pharmaceutical compositions 
containing further substances like e.g. pharmacologically acceptable 
auxiliary and carrier substances. However, the protein according to the 
invention, also in case a matrix material is used, does not necessarily have 
to be used together with this matrix material, but can also be administered 
systemically, wherein it concentrates preferably in the surrounding of an 
implanted matrix material. 


is For some applications the protein according to the invention and the nucleic 
acid forming this protein, respectively or the expression vector or host cell 
can preferably be present in an injectable composition. Implants are not 
necessary or possible for every form of application of the proteins according 
to the invention. However, it is also pns s ihl* to provid o an i mp l an ta ble 


25 


30 


vessel or an implantable micropump containing for example semipermeable 
membranes in which the protein according to the invention or the nucleic 
acid generating it is contained, from which either one is slowly released 
over a prolonged period of time. The pharmaceutical composition according 
to the invention can also contain other vehicles which make it possible that 
the proteins or the nucleic acids or the expression vectors encoding these 
proteins be transported to the site of activity and released there, wherein 
e.g. liposomes or nanospheres can be used. In principle, it is also possible 
to apply host cells, like e.g. implanted embryonic cells expressing the 
proteins. Cells transfected with recombinant DNA may be encapusled prior 
to implantation. Any other practicable but herein not explicitly described 
form of application of the pharmaceutical composition according the 
invention and their corresponding manufacture are also comprised by the 
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present invention, as long as they contain a protein according to the 
invention or a nucleic acid or an expression vector coding therefor, or a host 
cell expressing it. 

5 Although the indications shall not be restricted herein and all indications 
exhibiting the dimeric form of the protein according to the invention are also 
comprised, in the following types of application for the compositions 
according to the invention are listed which are considered to be particularly 
preferred indications for the proteins of the present invention. On the one 

> hand, there is the prevention or therapy of diseases associated with bone 
and/or cartilage damage or affecting bone and/or cartilage disease, or 
generally situations, in which cartilage and/or bone formation is desirable or 
for spinal fusion, and on the other hand, there is prevention or therapy of 
damaged or diseased tissue associated with connective tissue including 
tendon and/or ligament, periodontal or dental tissue including dental 
implants, neural tissue including CNS tissue and neuropathological 
situations, tissue of the sensory system, liver, pancreas, cardiac, blood 
vessel, renal, uterine and thyroid tissue, skin, mucous membranes, 
endothelium, epithelium, for promotion or induction of nerve growth, tissue 
regeneration, angiogenesis, wound healing including ulcers, burns, injuries 
or skin grafts, induction of proliferation of progenitor cells or bone marrow 
cells, for maintenance of a state of proliferation or differentiation for 
treatment or preservation of tissue or cells for organ or tissue 
transplantation, for integrity of gastrointestinal lining, for treatment of 
disturbances in fertility, contraception or pregnancy. 

Diseases concerning sensory organs like the eye are also to be included in 
the preferred indication of the pharmaceutical composition according to the 
invention. As neuronal diseases again Parkinson's and Alzheimer's diseases 
can be mentioned as examples. 
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The pharmaceutical compositions according to the invention can be used in 
any desired way. the pharmaceutical compositions are formulated preferably 
for surgical local application, topical or systemic application. Auxiliary 
substances for the individual application form can of course be present in 
the pharmaceutical composition according to the invention. For some 
applications it can be advantageous to add some further substances to the 
pharmaceutical composition as for example Vitamin D (WO 92/21365), 
parathyroid hormone related peptide (WO 97/35607), chordin (WO 98/ 
21335), anti-fibrinolytic agent (EP 535091), anti-metabolites (WO 
95/09004), alkyl cellulose (WO 93/00050). mannitol (WO 98/33514), 
sugar, glycine, glutamic acid hydrochloride (U.S. 5,385,887), antibiotics, 
antiseptics, amino acids and/or additives which improve the solubility or 
stablility of the monomeric morphogenetic protein as for example nonionic 
detergents (e.g. Tween 80), basic amino acids, carrier proteins (e.g. serum 
albumin), full length propeptides of the TGF-y? superfamily or parts thereof. 

As can be already gathered from the description of proteins, nucleic acids 
and pharmaceutical compositions, the proteins according to the invention 
and respective nucleic acids, which provide for an expression of the 
Proteins at the site ot activity, can advantageously be applied in all areas for 
which also the dimeric forms of the proteins, as described, can be applied. 
In the framework of the present invention therefore a further subject matter 
is the use of a pharmaceutical composition according to the present 
invention for the treatment or prevention of any indications of the dimeric 
forms of the proteins according to the invention. 

Herein it is again possible to conduct surgical operations and to implant the 
pharmaceutical composition (in particular contained on a matrix material), 
an administration in liquid or otherwise suitable form via, e.g. injection or 
oral administration seems to be as suitable as a topical application for e.g. 
tissue regeneration. 
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Rg. 1A shows a two dimensional graph of the conformation of 
recombinantly produced dimeric MP52 with the deleted first alanine. In this 
figure the 7 cysteine bridges contained in a dimer are shown, wherein there 
are 3 intramolecular cystine bridges per monomer unit and 1 intermolecular 
cystine brigde connecting both monomers. Fig. 1B shows the monomeric 
protein according to the invention wherein the cysteine of the amino acid 
sequence of MP52 has been replaced by X that denotes any amino acid 
except cysteine. 


Examples 

The examples shall explain the invention more illustratively and are not 
considered to be restrictive. 

Expression of monomeric MP52 in E. coli . refolding , purification and 
possible test systems to determine the biological activity 

The mature part of human MP52 was mutated by converting Cysteine to 
Alanine at position 465 of SEQ ID NO 2. For this purpose the nucleotides 
at the position 2032 and 2033 of SEQ ID NO 1 were converted from TG to 
GC. The substitution of the nucleotides was performed by using the 
prokaryotic plasmid pBP2MP52m and a PCR method. 

The vector pBP2MP52m is a derivative of the pBR322 plasmid containing 
an ampicillin resistance gene, a T7-promoter followed by a ribosomal 
binding site, a start codon, the mature part of MP52 (coding for the amino 
acids 382-501 in SEQ ID NO. 2 with a Cysteine at position 465), stop 
codons in each reading frame and a terminator. The plasmid was deposited 
at the DSM (DSM 10029, 2nd June 1995). 

The mutation was performed by using the QuickChange™ Site-Directed 
Mutagenesis Kit with the PfuTurbo™ DNA polymerase and the Dpn I 
endonuclease from Stratagene (Catalog #200518) according to the 
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instruction manual of the manufacturer. The oligonucleotide 
CCACACCACCCACCGCCTGTGTGCCCACGC (SEQ ID NO. 5) and the 
oligonucleotide GCGTGGGCACACAGGCGGTGGGTGGTGTGG (SEQ ID NO. 
6) were purified by polyacrylamide gel electrophoresis and used as the 
s mutagenic primers. The resulting plasmid pBP2MP52 AW contained after the 
start codon the nucleotide sequence coding for the amino acids 382-501 
in SEQ ID NO. 2 with an Alanine at position 465 which was verified by 
sequencing. 

io The expression of monomeric MP52 can be induced by providing a source 
of T7 RNA polymerase. Using the bacterial strain BL21(DE3)pLysS 
(Novagen) transformed with the plasmid pBP2MP52 AlA and inducing the T7 
RNA polymerase gene according to the manufacturers instructions with 
IPTG, monomeric MP52 can be expressed in inclusion bodies which can be 
is isolated according to standard procedures. Further purification was done on 
a reversed phase column (Nucleosil 300-7C4, Machery-Nagel) with a 0 to 
50% buffer B gradient (buffer A: 0.1% TFA in water, buffer B: 90% 
acetonitrile, 0.1 % TFA) in 50 minutes (flow rate: 2 ml/min). The fractions 
containing monomeric MP52 were pooled, lyophilized and stored at -70°C. 
~ 20 MP52 was solubilized in a denaturing buffer (6M Urea, 500 mM NaCI, 10 
mM DTT, 1 mM EDTA , 20 mM Tris pH 8.3) and added for refolding in 9 
times of a 50 mM Glycine-buffer system (pH 9.8) containing CHAPS (20 
mM), NaCI (500 mM) and 3 mM GSSG and gently stirred at 4°C. After 
approximately 24 hours the sample was diluted 2.8 times with 14 mM 
25 NaH 2 P0 4 and subjected to isoelectric precipitation. After centrifugation the 
precipitate was dissolved in 0.1 % TFA and the folded monomer was 
further purified by reversed phase HPLC. For this purpose MP52 was loaded 
on a column (Aquapore Octyl 20 micron. Applied Biosystems) equilibrated 
with 25% buffer B (buffer A: 0.1% TFA in water, buffer B: 90% 
30 acetonitrile, 0.1 % TFA). Monomeric MP52 was eluted with a 25-60% 
buffer B gradient in 70 minutes (flow rate 3 ml/min). Fractions containing 
purified refolded monomeric MP52 were pooled, lyophilized, stored at-70°C 
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and can be used in biological activity studies. Useful for biological activity 
studies of monomeric MP52 in the field of cartilage and bone induction are 
for example the measurement of increasing ALP activity (Takuwa et al., 
Am. J. Physiol. 257, E797-E803, 1 989) using osteoprogenitor-Iike ROB-C26 
cells (Yamaguchi et al., Calcif. Tissue Int. 49, 221-225, 1991) as described 
in W095/04819 or ATDC5 cells (Riken Gene Bank, RCB 0565, embryonic 
cells which differentiate like cartilage cells). Useful activity tests showing 
the neurological capacities of refolded monomeric MP52 are the 
measurement of increased survival of dopaminergic neurons as described 
for example by Krieglstein et al. (J. Neuroscience Res. 42, 724-732, 1 995) 
or Sullivan et al. (Neuroscience Letters 233, 73-76, 1997) or the 
outgrowth of nerve fibers from embryonic retina are described for example 
in the WO 97/031 88. The angiogenic potential of folded monomeric MP52 
can be verified for example in an in vivo corneal micropocket model. Briefly, 
Hydron can be used as a slow release pellet adapted from D 'Amato et al 
(Proc. Natl. Acad. Sci. USA 91 (1994) 4082-4085). About 2 //g of 
monomeric MP52 can be dissolved in 20 fA of 50% acetonitrile, stabilized 
with 5 mg sucralfate (sucrose aluminum sulfate; Bukh Meditec, 
Copenhagen) and mixed with 20^1 of 1 2% (wt/vol) Hydron NCC (Interferon 
Science, New Brunswick, NJ) in ethanol. The mixture can be pipetted onto 
Teflon pegs and dried to produce a pellet. Subsequently the pellet can be 
implanted into corneal micropockets of the eye of a male New Zealand 
White rabbit. For this purpose an intrastromal tunnel, starting from a 
perpendicular linear incision in the pupillary plane and ending 2.75 mm prior 
to the limbus can be created. As a postsurgical care one single dose of 
erythromycin ointment should be given onto the surface of the cornea. 
About 10 days after implantation new capillaries which grow toward the 
implantation site should be seen. The outgrowth towards the pellet can 
reflect the chemotactic response of the endothelial cells to the released 
MP52 and can be quantified by measuring the vascularized area and the 
number of vessels at the limbus . 
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Monomeric MP52 isolated and purified from inclusion bodies but not 
subjected to the refolding process can be used as a negative control. 
Dimeric MP52 can be used as the positive control. 

Expression of monomeric MP121 in E.coli . refolding, purification and 
possible test systems to determine the biological activity 

The mature part of human MP121 was mutated by converting Cysteine to 
Alanine at position 316 of SEQ ID NO 4. For this purpose the nucleotides 
at position 1073 and 1074 of SEQ ID NO. 3 were converted from TG to 
GC. The substitution of the nucleotides was performed essentially as 
described for MP52 in the above example with differences in the plasmid 
and the mutagenic primers. The pBP4MP121m plasmid was used which 
differs from the pBP2MP52m plasmid by containing the mature part of 
MP121 (coding for the amino acids 237-352 in SEQ ID NO. 4 with a 
Cysteine at position 316) instead of the mature part of MP52 and by 
harboring a tetracycline resistance gene instead of the ampicillin resistance 
gene. As mutagenic primers the oligonucleotide 
CTGGAGGGGGCTCAGCCTGTGTACCCACGG (SEQ ID NO. 7) and the 
oligonucleotide CCGTGGGTACACAGGCTGAGCCCUC I CCAG (SbU ID NO. 
8) were used. 

The resulting plasmid PBP4MP121ALA contained after the start codon the 
nucleotide sequence coding for the amino acids 237-352 in SEQ ID NO. 4 
with an Alanine at position 316 which was verified by sequencing. 

The expression of monomeric MP121, isolation of inclusion bodies and 
purification was performed essentially as described for MP52 in the above 
example, with the exception that another E.coli strain, the HMS 1 74 (DE3) 
(Novagen), was used. 
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For biological activity studies of refolded monomelic MP121 showing the 
neurological capacities the same assays as mentioned for MP52 can be 
used. The influence of refolded monomeric MP121 on liver growth can be 
investigated for example on rat hepatocytes in primary cultures by inhibiting 
EGF induced DNA synthesis. Briefly, hepatocytes can be isolated from rat 
(Wistar) liver and cultured according to Yasuda et al. (J. Clin. Invest. Vol. 
92, 1491-1496 (1993)). The cells have to be washed prior to incubation 
with fresh serum-free medium containing 0.1 nM insulin. 0.1 % BSA and 1 
nM EGF. The solubilized refolded monomeric MP121 can be added to the 
medium at various concentrations and subsequently the hepatocytes have 
to be incubated for 72h including 0.5 ^i [ 3 H]Thymidine/ml for the last 24 
hours as described by Mead & Fausto (Proc. Natl.Acad.Sci.USA 86, 1 558- 
15562 (1989)). I 3 H]Thymidine incorporation into trichloracetic acid- 
precitable material can be measured as described by McNiel et al. {J. Cell 
Biol. 101, 372-379 (1985)). 


Stimulation of hematopoiesis can be determined for examples on bone 
marrow aspirate cultures as described in example 8 of W098/22492. 
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SEQUENCE LISTING 

<110> HyGene AG 

<120> Monomeric Protein of the TGF-beta Family 

<130> 20780pwomd 

<140> 
<141> 

<150> EP99115613 
15 <151> 1999-08-06 

<160> 8 

<170> Patentln Ver. 2.1 


20 


25 


35 


40 


<210> 1 

<211> 2703 

<212> DNA 

<213> Homo sapiens 


<220> 
<221> CDS 


<222> (640) . . (2142) 


30 <400> 1 

ccatggcctc gaaagggcag cggtgatttt tttcacataa atatatcgca cttaaatgag 60 
tttagacagc atgacatcag agagtaatta aattggtttg ggttggaatt ccgtttccaa 120 
ttcctgagtt caggtttgta aaagattttt ctgagcacct gcaggcctgt gagtgtgtgt 180 


gtgtgtgtgt gtgtgtgtgt gtgtgtgtga agtattttca ctggaaagga ttcaaaacta 240 
gggggaaaaa aaaactggag cacacaggca gcattacgcc attcttcctt cttggaaaaa 300 


5 
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tccctcagcc ttatacaagc ctccttcaag ccctcagtca gttgtgcagg agaaaggggg 360 
cggttggctt tctcctttca agaacgagtt attttcagct gctgactgga gacggtgcac 420 
gtctggatac gagagcattt ccactatggg actggataca aacacacacc cggcagactt 480 
10 caagagtctc agactgagga gaaagccttt ccttctgctg ctactgctgc tgccgctgct 540 
tttgaaagtc cactcctttc atggtttttc ctgccaaacc agaggcacct ttgctgctgc 600 

15 

cgctgttctc tttggtgtca ttcagcggct ggccagagg atg aga etc ccc aaa 654 

Met Arg Leu Pro Lys 
1 5 

20 etc etc act ttc ttg ctt tgg tac ctg get tgg ctg gac ctg gaa ttc 702 
Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala Trp Leu Asp Leu Glu Phe 
10 15 20 

ate tgc act gtg ttg ggt gee cct gac ttg ggc cag aga ccc cag ggg 750 
25 He Cys Thr Val Leu Gly Ala Pro Asp Leu Gly Gin Arg Pro Gin Gly 
25 30 35 

ace agg cca gga ttg gee aaa gca gag gee aag gag agg ccc ccc ctg 798 
Thr Arg Pro Gly Leu Ala Lys Ala Glu Ala Lys Glu Arg Pro Pro Leu 
30 40 45 50 

gec egg aac gtc ttc agg cca ggg ggt cac age tat ggt ggg ggg gec 846 
Ala Arg Asn Val Phe Arg Pro Gly Gly His Ser Tyr Gly Gly Gly Ala 
55 60 65 


35 


ace aat gee aat gee agg gca aag gga ggc acc ggg cag aca gga ggc 894 
Thr Asn Ala Asn Ala Arg Ala Lys Gly Gly Thr Gly Gin Thr Gly Gly 
70 75 80 85 


40 


ctg aca cag ccc aag aag gat gaa ccc aaa aag ctg ccc ccc aga ccg 
Leu Thr Gin Pro Lys Lys Asp Glu Pro Lys Lys Leu Pro Pro Arg Pro 
90 95 100 


942 
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ggc ggc cct gaa ccc aag cca gga cac cct ccc caa aca agg cag get 990 
Gly Gly Pro Glu Pro Lys Pro Gly His Pro Pro Gin Thr Arg Gin Ala 
105 no 115 

aca gec egg act gtg acc cca aaa gga cag ctt ccc gga ggc aag gca 1038 
Thr Ala Arg Thr Val Thr Pro Lys Gly Gin Leu Pro Gly Gly Lys Ala 
120 125 130 

ccc cca aaa gca gga tct gtc ccc age tec ttc ctg ctg aag aag gee 1086 
Pro Pro Lys Ala Gly Ser Val Pro Ser Ser Phe Leu Leu Lys Lys Ala 
135 140 145 

agg gag ccc ggg ccc cca cga gag ccc aag gag ccg ttt cgc cca ccc 1134 
Arg Glu Pro Gly Pro Pro Arg Glu Pro Lys Glu Pro Phe Arg Pro Pro 
150 155 iso 165 

ccc ate aca ccc cac gag tac atg etc teg ctg tac agg acg ctg tec 1182 
Pro He Thr Pro His Glu Tyr Met Leu Ser Leu Tyr Arg Thr Leu Ser 
170 175 iso 

gat get gac aga aag gga ggc aac age age gtg aag ttg gag get ggc 1230 
Asp Ala Asp Arg Lys Gly Gly Asn Ser Ser Val Lys Leu Glu Ala Gly 
185 190 195 

ctg gec aac acc ate acc age ttt att gac aaa ggg caa gat gac cga 1278 
Leu Ala Asn Thr He Thr Ser Phe He Asp Lys Gly Gin Asp Asp Arg 
200 205 210 


ggt ccc gtg gtc agg aag cag agg tac gtg ttt gac att agt gec ctg 1326 
Gly Pro Val Val Arg Lys Gin Arg Tyr Val Phe Asp He Ser Ala Leu 
215 220 225 

gag aag gat ggg ctg ctg ggg gee gag ctg egg ate ttg egg aag aag 1374 
Glu Lys Asp Gly Leu Leu Gly Ala Glu Leu Arg He Leu Arg Lys Lys 
230 "5 240 245 

ccc teg gac acg gee aag cca gcg gee ccc gga ggc ggg egg get gec 1422 
Pro Ser Asp Thr Ala Lys Pro Ala Ala Pro Gly Gly Gly Arg Ala Ala 
250 255 260 

cag ctg aag ctg tec age tgc ccc age ggc egg cag ccg gec tec ttg 1470 
Gin Leu Lys Leu Ser Ser Cys Pro Ser Gly Arg Gin Pro Ala Ser Leu 
265 270 275 
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ctg gat gtg cgc tec gtg cca ggc ctg gac gga tct ggc tgg gag gtg 1518 
Leu Asp Val Arg Ser Val Pro Gly Leu Asp Gly Ser Gly Trp Glu Val 
280 285 290 


ttc gac ate tgg aag etc ttc cga aac ttt aag aac teg gec cag ctg 1566 
Phe Asp He Trp Lys Leu Phe Arg Asn Phe Lys Asn Ser Ala Gin Leu 
295 300 305 


tgc ctg gag ctg gag gec tgg gaa egg ggc agg gee gtg gac etc cgt 1614 
10 Cys Leu Glu Leu Glu Ala Trp Glu Arg Gly Arg Ala Val Asp Leu Arg 
310 315 320 325 


ggc ctg ggc ttc gac cgc gee gec egg cag gtc cac gag aag gec ctg 1662 
Gly Leu Gly Phe Asp Arg Ala Ala Arg Gin Val His Glu Lys Ala Leu 
15 330 335 340 

ttc ctg gtg ttt ggc cgc acc aag aaa egg gac ctg ttc ttt aat gag 1710 
Phe Leu Val Phe Gly Arg Thr Lys Lys Arg Asp Leu Phe Phe Asn Glu 
345 350 355 

20 

att aag gec cgc tct ggc cag gac gat aag acc gtg tat gag tac ctg 1758 
He Lys Ala Arg Ser Gly Gin Asp Asp Lys Thr Val Tyr Glu Tyr Leu 
360 365 370 


25 ttc age cag egg cga aaa egg egg gec cca ctg gec act cgc cag ggc 1806 
Phe Ser Gin Arg Arg Lys Arg Arg Ala Pro Leu Ala Thr Arg Gin Gly 
375 380 385 


aag cga ccc age aag aac ctt aag get cgc tgc agt egg aag gca ctg 1854 
30 Lys Arg Pro Ser Lys Asn Leu Lys Ala Arg Cys Ser Arg Lys Ala Leu 
390 395 400 405 

cat gtc aac ttc aag gac atg ggc tgg gac gac tgg ate ate gca ccc 1902 
His Val Asn Phe Lys Asp Met Gly Trp Asp Asp Trp He He Ala Pro 
35 410 415 420 


ctt gag tac gag get ttc cac tgc gag ggg ctg tgc gag ttc cca ttg 1950 
Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu Cys Glu Phe Pro Leu 
425 430 435 

40 

cgc tec cac ctg gag ccc acg aat cat gca gtc ate cag acc ctg atg 1998 
Arg Ser His Leu Glu Pro Thr Asn His Ala Val He Gin Thr Leu Met 
440 445 450 
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aac tec atg gac ccc gag tec aca cca cec acc nnn tgt gtg ccc acg 2046 
Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr Xaa Cys Val Pro Thr 
455 460 465 

5 egg ctg agt ccc ate age ate etc ttc att gac tct gee aac aac gtg 2094 
Arg Leu Ser Pro lie Ser He Leu Phe lie Asp Ser Ala Asn Asn Val 
470 47 * 480 485 


10 


15 


20 


25 


30 


gtg tat aag cag tat gag gac atg gtc gtg gag teg tgt ggc tgc agg . 2142 
Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly Cys Arg 
490 495 soo 

tagcagcact ggccctctgt cttcctgggt ggcacatccc aagagcccct tcctgcactc 2202 

ctggaatcac agaggggtca ggaagctgtg gcaggagcat ctacacagct tgggtgaaag 2262 

gggattccaa taagcttget cgctctctga gtgtgacttg ggctaaaggc ccccttttat 2322 

ccacaagttc ccctggctga ggattgctgc ccgtctgctg atgtgaccag tggcaggcac 2382 

aggtccaggg agacagactc tgaatgggac tgagtcccag gaaacagtgc tttccgatga 2442 

gactcagccc accatttctc ctcacctggg ccttctcagc ctctggactc tcctaagcac 2502 

ctctcaggag agecacaggt gccactgcct cctcaaatca catttgtgcc tggtgacttc 2562 

ctgtccctgg gacagttgag aagctgactg ggcaagagtg ggagagaaga ggagagggct 2622 


tggatagagt tgaggagtgt gaggctgtta gactgttaga tttaaatgta tattgatgag 2682 
ataaaaagca aaactgtgcc t 


2703 


<210> 2 
35 <211> 501 
<212> PRT 
<213> Homo sapiens 

<400> 2 

40 Met Arg Leu Pro Lys Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala Trp 


45 


Leu Asp Leu Glu Phe He Cys Thr Val Leu Gly Ala Pro Asp Leu Gly 
20 25 30 
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Gln Arg Pro Gin Gly Thr Arg Pro Gly Leu Ala Lys Ala Glu Ala Lys 
35 «'« 45 


Glu Arg Pro Pro Leu Ala Arg Asn Val Phe Arg Pro Gly Gly His Ser 
5 50 55 60 

Tyr Gly Gly Gly Ala Thr Asn Ala Asn Ala Arg Ala Lys Gly Gly Thr 
" 70 " 

10 Gly Gin Thr Gly Gly Leu Thr Gin Pro Lys Lys Asp Glu Pro Lys Lys 


85 90 


95 


15 


Leu Pro Pro Arg Pro Gly Gly Pro Glu Pro Lys Pro Gly His Pro Pro 
100 xos 110 


Gin Thr Arg Gin Ala Thr Ala Arg Thr Val Thr Pro Lys Gly Gin 
US 120 12S 


Leu 


Pro Gly Gly Lys Ala Pro Pro Lys Ala Gly Ser Val Pro Ser Ser Phe 
20 "0 135 140 


Leu Leu Lys Lys Ala Arg Glu Pro Gly Pro Pro Arg Glu Pro Lys Glu 

160 


145 150 1S5 


25 Pro Phe Arg Pro Pro Pro lie Thr Pro His Glu Tyr Met Leu Ser Leu 

165 170 175 

Tyr Arg Thr Leu Ser Asp Ala Asp Arg Lys Gly Gly Asn Ser Ser Val 
180 185 190 

30 

Lys Leu Glu Ala Gly Leu Ala Asn Thr lie Thr Ser Phe lie Asp Lys 
195 200 JOS 

Sly Gin Asp Asp Arg Gly Pro Val Val Arg Lys Gin Arg Tyr Val Phe 
35 210 215 220 

Asp lie Ser Ala Leu Glu Lys Asp Gly Leu Leu Gly Ala Glu Leu Arg 
225 230 235 240 

40 He Leu Arg Lys Lys Pro Ser Asp Thr Ala Lys Pro Ala Ala Pro Gly 

2 « 250 255 

Gly Gly Arg Ala Ala Gin Leu Lys Leu Ser Ser Cys Pro Ser Gly Arg 
260 26S 270 

45 
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Gin Pro Ala Ser Leu Leu Asp Val Arg Ser Val Pro Qly Leu Asp Gly 
275 280 285 

Ser Gly Trp Glu Val Phe Asp He Trp Lys Leu Phe Arg Asn Phe Lys 
5 290 295 300 

Asn Ser Ala Gin Leu Cys Leu Glu Leu Glu Ala Trp Glu Arg Gly Arg 
305 310 315 320 

10 Ala Val Asp Leu Arg Gly Leu Gly Phe Asp Arg Ala Ala Arg Gin Val 

325 330 335 

His Glu Lys Ala Leu Phe Leu Val Phe Gly Arg Thr Lys Lys Arg Asp 
340 345 350 

15 

Leu Phe Phe Asn Glu He Lys Ala Arg Ser Gly Gin Asp Asp Lys Thr 
355 360 365 


Val Tyr Glu Tyr Leu Phe Ser Gin Arg Arg Lys Arg Arg Ala Pro Leu 
20 370 375 380 

Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu Lys Ala Arg Cys 
385 390 395 400 

25 Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly trp Asp Asp 
4^5 41 c 


30 


Trp He He Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu 

420 425 430 

Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala Val 
435 440 445 


He Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr 
35 450 455 460 

Xaa Cys Val Pro Thr Arg Leu Ser Pro He Ser He Leu Phe He Asp 
465 470 475 480 

40 Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu 

485 490 495 


Ser Cys Gly Cys Arg 
500 

45 
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10 


15 


20 


25 


35 


40 


<210> 3 

<211> 2272 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (128) .. (X183) 
<400> 3 

caaggagcca tgccagctgg acacacactt cttccagggc ctctggcagc caggacagag 60 

ttgagaccac agctgttgag accctgagcc ctgagtctgt attgctcaag aagggccttc 120 

cccagca atg acc tec tea ttg ctt ctg gee ttt etc etc ctg get cca 169 
Met Thr Ser Ser Leu Leu Leu Ala Phe Leu Leu Leu Ala Pro 
IS io 

acc aca gtg gec act ccc aga get ggc ggt cag tgt cca gca tgt ggg 217 
Thr Thr Val Ala Thr Pro Arg Ala Gly Gly Gin Cys Pro Ala Cys Gly 
15 20 25 30 

ggg ccc acc ttg gaa ctg gag age cag egg gag ctg ctt ctt gat ctg 265 
Gly Pro Thr Leu Glu Leu Glu Ser Gin Arg Glu Leu Leu Leu Asp Leu 
35 40 45 


gec aag aga age ate ttg gac aag ctg cac etc acc cag cgc cca aca 313 
Ala Lys Arg Ser He Leu Asp Lys Leu His Leu Thr Gin Arg Pro Thr 
30 50 55 60 

ctg aac cgc cct gtg tec aga get get ttg agg act gca ctg cag cac 361 
Leu Asn Arg Pro Val Ser Arg Ala Ala Leu Arg Thr Ala Leu Gin His 
65 70 75 


etc cac ggg gtc cca cag ggg gca ctt eta gag gac aac agg gaa cag 409 
Leu His Gly Val Pro Gin Gly Ala Leu Leu Glu Asp Asn Arg Glu Gin 
80 as 90 

gaa tgt gaa ate ate age ttt get gag aca ggc etc tec acc ate aac 457 
Glu Cys Glu He He Ser Phe Ala Glu Thr Gly Leu Ser Thr He Asn 
95 100 105 no 
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10 


20 


25 


30 


40 


cag act cgt ctt gat ttt cac ttc tec tct gat aga act get ggt gac 505 
Gin Thr Arg Leu Asp Phe His Phe Ser Ser Asp Arg Thr Ala Gly Asp 
115 120 125 

agg gag gtc cag cag gec agt etc atg ttc ttt gtg cag etc cct tec 553 
Arg Glu Val Gin Gin Ala Ser Leu Met Phe Phe Val Gin Leu Pro Ser 
"0 135 140 

aat acc act tgg acc ttg aaa gtg aga gtc ctt gtg ctg ggt cca cat 601 
Asn Thr Thr Trp Thr Leu Lys Val Arg Val Leu Val Leu Gly Pro His 
145 150 155 


aat acc aac etc acc ttg get act cag tac ctg ctg gag gtg gat gee 649 
Asn Thr Asn Leu Thr Leu Ala Thr Gin Tyr Leu Leu Glu Val Asp Ala 
15 160 165 170 

agt ggc tgg cat caa etc ccc eta ggg cct gaa get caa get gec tgc 697 
Ser Gly Trp His Gin Leu Pro Leu Gly Pro Glu Ala Gin Ala Ala Cys 
175 180 185 190 


age cag ggg cac ctg acc ctg gag ctg gta ctt gaa ggc cag gta gec 745 
Ser Gin Gly His Leu Thr Leu Glu Leu Val Leu Glu Gly Gin Val Ala 
195 200 205 

cag age tea gtc at e ctg got aga get gm r-nh ngj , , t UL 1 j Lj ^ jjj- 

Gln Ser Ser Val He Leu Gly Gly Ala Ala His Arg Pro Phe Val Ala 
210 215 220 

gec egg gtg aga gtt ggg ggc aaa cac cag att cac cga cga ggc ate 841 
Ala Arg Val Arg Val Gly Gly Lys His Gin He His Arg Arg Gly He 
225 230 235 


gac tgc caa gga ggg tec agg atg tgc tgt cga caa gag ttt ttt gtg 
Asp Cys Gin Gly Gly Ser Arg Met Cys Cys Arg Gin Glu Phe Phe Val 

35 240 24S 250 


889 


gac ttc cgt gag att ggc tgg cac gac tgg ate ate cag cct gag ggc 937 
Asp Phe Arg Glu He Gly Trp His Asp Trp He He Gin Pro Glu Gly 
25 5 260 265 2 70 


tac gec atg aac ttc tgc ata ggg cag tgc cca eta cac ata gca ggc- 
Tyr Ala Met Asn Phe Cys He Gly Gin Cys Pro Leu His He Ala Gly 
275 280 285 


985 


10 
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atg cct ggt att get gcc tec ttt cac act gca gtg etc aat ctt etc 1033 
Met Pro Gly He Ala Ala Ser Ph His Thr Ala Val Leu Asn Leu Leu 
290 295 300 

aag gcc aac aca get gca ggc acc act gga ggg gg C tea nnn tgt gta 1081 
Lys Ala Asn Thr Ala Ala Gly Thr Thr Gly Gly Gly Ser Xaa Cys Val 
305 310 3i5 

ccc acg gcc egg cgc ccc ctg tct ctg etc tat tat gac agg gac age 1129 
Pro Thr Ala Arg Arg Pro Leu Ser Leu Leu Tyr Tyr Asp Arg Asp Ser 
320 325 330 


aac att gtc aag act gac ata cct gac atg gta gta gag gcc tgt ggg 1177 
Asn He Val Lys Thr Asp He Pro Asp Met Val Val Glu Ala Cys Gly 
1 5 335 340 345 350 

tgc agt tagtctatgt gtggtatggg cagcccaagg ttgcatggga aaacacgccc 1233 
Cys Ser 

20 ctacagaagt gcacttcctt gagaggaggg aatgacctca ttctctgtcc agaatgtgga 1293 
ctccctcttc ctgagcatct tatggaaatt accccacctt tgacttgaag aaaccttcat 1353 
ctaaagcaag tcactgtgcc atcttcctga ccactaccct ctttcctagg gcatagtcca 1413 

25 

tcccgctagt ccatcccgct agccccactc cagggactca gacccatctc caaccatgag 1473 
caatgccatc tggttcccag gcaaagacac ccttagctca cctttaatag accccataac 1533 
30 ccactatgcc ttcctgtcct ttctactcaa tggtccccac tccaagatga gttgacacaa 1593 
ccccttcccc caatttttgt ggatctccag agaggecett ctttggattc accaaagttt 1653 
agatcactgc tgcccaaaat agaggcttac ctacccccct ctttgttgtg agcccctgtc 1713 

35 

cttcttagtt gtccaggtga actactaaag ctctctttgc ataccttcat ccattttttg 1773 
tccttctctg cctttctcta tgcccttaag gggtgacttg cctgagctct atcacctgag 1833 
40 ctcccctgcc ctctggcttc ctgctgaggt cagggcattt cttatccctg ttccctctct 1893 
gtctaggtgt catggttctg tgtaactgtg gctattctgt gtccctacac tacctggcta 1953 
cccccttcca tggccccagc tctgcctaca ttctgatttt tttttttttt ttttttttga 2013 
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35 


aaagttaaaa attccttaat tttttattcc tggtaccact accacaattt acagggcaat 2073 
atacctgatg taatgaaaag aaaaagaaaa agacaaagct acaacagata aaagacctca 2133 
5 ggaatgtaca tctaattgac actacattgc attaatcaat agctgcactt tttgcaaact 2X93 
gtggctatga cagtcctgaa caagaagggt ttcctgttta agctgcagta acttttctga 2253 

ctatggatca tcgttcctt 


10 


2272 


15 


20 


<210> 4 

<211> 352 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Thr Ser Ser Leu Leu Leu Ala Phe Leu Leu Leu Ala Pro Thr Thr 
Val Ala Thr Pro Arg Ala Gly Gly Gin Cys Pro Ala Cys Gly Gly Pro 


20 


25 


30 


25 


Thr Leu Glu Leu Glu Ser Gin Arg Glu Leu Leu Leu Asp Leu Ala Lys 
35 40 


45 



Arrr p ™ t *t- T .r u A nn 
60 


30 Arg Pro Val Ser Arg Ala Ala Leu Arg Thr Ala Leu Gin His Leu His 


65 


70 


75 


80 


35 


40 


Gly Val Pro Gin Gly Ala Leu Leu Glu Asp Asn Arg Glu Gin Glu Cys 
85 50 95 

Glu lie lie Ser Phe Ala Glu Thr Gly Leu Ser Thr lie Asn Gin Thr 
100 110 

Arg Leu Asp Phe His Phe Ser Ser Asp Arg Thr Ala Gly Asp Arg Glu 


115 


120 


125 


Val Gin Gin Ala Ser Leu Met Phe Phe Val Gin Leu Pro Ser 


130 


Asn Thr 


135 


140 


45 
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Thr Trp Thr Leu Lys Val Arg Val Leu Val Leu Gly Pro His Asn Thr 
145 150 155 160 

Asn Leu Thr Leu Ala Thr Gin Tyr Leu Leu Glu Val Asp Ala Ser Gly 
5 165 170 175 

Trp His Gin Leu Pro Leu Gly Pro Glu Ala Gin Ala Ala Cys Ser Gin 
180 185 190 

10 Gly His Leu Thr Leu Glu Leu Val Leu Glu Gly Gin Val Ala Gin Ser 
195 200 205 


15 


Ser Val lie Leu Gly Gly Ala Ala His Arg Pro Phe Val Ala Ala Arg 
210 215 220 

Val Arg Val Gly Gly Lys His Gin He His Arg Arg Gly He Asp Cys 
225 230 235 240 


20 


Gin Gly Gly Ser Arg Met Cys Cys Arg Gin Glu Phe Phe Val Asp Phe 
245 250 255 


Arg Glu He Gly Trp His Asp Trp He He Gin Pro Glu Gly Tyr Ala 
260 265 270 

25 Met Asn Phe Cys He Gly Gin Cys Pro Leu His He Ala Gly Met Pro 
275 280 285 


30 


Gly He Ala Ala Ser Phe His Thr Ala Val Leu Asn Leu Leu Lys Ala 
290 295 300 

Asn Thr Ala Ala Gly Thr Thr Gly Gly Gly Ser Xaa Cys Val Pro Thr 
305 310 315 320 


35 


Ala Arg Arg Pro Leu Ser Leu Leu Tyr Tyr Asp Arg Asp Ser Asn He 
325 330 335 


Val Lys Thr Asp He Pro Asp Met Val val Glu Ala Cys Gly Cys Ser 
340 345 350 


<210> 5 

<211> 30 

<212> DNA 

45 <213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 5 

ccacaccacc caccgcctgt gtgcccacgc 


<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 6 

gcgtgggcac acaggcggtg ggtggtgtgg 


<210> 7 
<211> 30 

<21 2> DNA ■ ~ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 7 

ctggaggggg ctcagcctgt gtacccacgg 


<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 
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<400> 8 

ccgtgggtac acaggctgag ccccctccag 3Q 


5 
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Claims 


Protein selected from the members of the TGF-B superfamily, 
characterized in that the protein is necessarily monomeric due to 
substitution or deletion of a cysteine which is responsible for dimer 
formation. 


Protein according to claim 1 , 

characterized in that the protein is a mature protein or a biologically 
active part or variant thereof. 

Protein according to any one of the preceeding claims, 
characterized in that the protein contains at least the 7 cysteine 
region characteristic for the TGF-B protein superfamily. 

Protein according to claim 3, 

characterized in that it contains a consensus sequence according to 
Formula I: C{Y) 25 . 29 CYYYC{Y) 25 . 35 XC{Y) 27 . 34 CYC or 
Formula II: C(Y) 28 CYYYC(Y) 3D .3 2 XC(Y) 31 CYC > wherein C denotes 
cysteine, Y denotes any amino acid and X denotes any amino acid 
except cysteine. 

Protein according to any one of claims 1 to 4, 
characterized in that the protein is a morphogenetic protein. 


Protein according to any one of the preceeding claims, 
characterized in that the proteins belongs to the TGF-/?, BMP, GDF, 
activin or GDNF family. 


Protein according to claim 6, 

characterized in that the protein is MP52 (GDF5) or a biologically 
active part or variant thereof. 
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8. Protein according to any one of the preceeding claims, 

characterized in that it comprises the amino acid sequence according 
to SEQ.ID.NO.2 or a part thereof. 

s 9. Protein according to claim 6, 

characterized in that the protein is MP121 or a biologically active part 
or variant thereof. 

1 0. Protein according to claim 9, 

10 characterized in that it comprises the amino acid sequence according 

to SEQ.ID.N0.4 or a part thereof. 

1 1 . Protein according to any one of claims 1 to 1 0, 

characterized in that the cysteine residue is substituted by an amino 
15 acid selected from the group of alanine, serine, threonine, leucine, 

isoleucine, glycine and valine. 

1 2. Protein according to any one of claims 1 to 1 1 , 

characterized in that it contains additional amino acids that facilitate 
20 or mediate the transfer and localization of the protein in a certain 

tissue. 

1 3. Nucleic acid, 

characterized in that it encodes a protein according to any one of 
25 claims 1 to 12. 

14. Nucleic acid according to claim 13, 
characterized in that it is a DNA. 


30 15. 


Nucleic acid according to claim 1 3 or 1 4, 

characterized in that it contains a sequence as shown 

SEQ.ID.N0.1 or a fragment thereof. 
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1 6. Nucleic acid according to claim 13 or 14, 

characterized in that it contains a sequence as shown in 
SEQ.ID.N0.3 or a fragment thereof. 

5 17. Nucleic acid according to any one of claims 13 to 16, 

characterized in that it further contains suitable expression control 
sequences facilitating and/or driving expression of the encoded 
protein. 


10 18. Expression vector, 

characterized in that it contains a nucleic acid according to any one 
of claims 1 3 to 1 7 in a suitable vector system. 

19. Expression vector according to claim 18, 

15 characterized in that the vector system is suitable for prokaryotic 

expression. 

20. Host cell, 

cnaractenzed in that it contains a nucleic acid according to any one 
eo of claims 1 3 to 1 7 or an expression vector according to claims 1 8 or 

19 and upon expression of said nucleic acid or vector is able to 
produce a monomeric protein according to any one of claims 1 to 1 2. 

21. Host cell according to claim 20, 

s characterized in that it is a prokaryotic host cell. 

22. Host cell according to claim 20, 
characterized in that it is an embryonal cell. 


30 23. 


Pharmaceutical composition, 

characterized in that it contains at least one protein according to any 
one of claims 1 to 1 2 or at least one nucleic acid according to any 
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one of claims 13 to 17, at least one expression vector according to 
any one of claims 1 8 or 1 9 or at least one host cell according to 
claim 20 or 22. 

5 24. Pharmaceutical composition according to claim 23, 

characterized in that the protein and/or nucleic acid are contained in 
and/or on a biocompatible matrix material. 

25. Pharmaceutical composition according to claim 24, 

io characterized in that the matrix material is biodegradable. 

26. Pharmaceutical composition according to claims 24 or 25, 
characterized in that the matrix material is itself osteogenically active. 

15 27. Pharmaceutical compostion according to any one of claims 23 to 26, 
for the prevention or therapy of diseases for which also the dimeric 
form of the protein would be indicated. 

28. Pharmaceutical composition according to claim 27, 
20 for prevention or therapy of diseases associated with bone and/or 

cartilage damage or affecting bone and/or cartilage disease or 
situations in which cartilage and/or bone growth is desirable or for 
spinal fusion. 

25 29. Pharmaceutical composition according to claim 27, 

for prevention or therapy of damaged or diseased tissue associated 
with connective tissue including tendon and/or ligament, periodontal 
or dental tissue including dental implants, neural tissue including CNS 
tissue and neuropathological situations, tissue of the sensory system, 

30 liver, pancreas, cardiac, blood vessel, renal, uterine and thyroid 

tissue, skin, mucous membranes, endothelium, epithelium, for 
promotion or induction of nerve growth, tissue regeneration, 
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angiogenesis, wound healing including ulcers, burns, injuries or skin 
grafts, induction of proliferation of progenitor cells or bone marrow 
cells, for maintenance of a state of proliferation or differentiation, for 
treatment or preservation of tissue or cells for organ or tissue 
s transplantation, for integrity of gastrointestinal lining, for treatment 

of disturbances in fertility, contraception or pregnancy. 

30. Pharmaceutical composition according to any one of claims 23 to 29 
for surgical local application, topical or systemic application. 

> 

3 1 . Pharmaceutical compostion according to any one of claims 23 to 30 
characterized in that it further contains pharmacologically acceptable 
auxiliary substances. 


Pharmaceutical composition according to any one of claims 30 or 31 , 
characterized in that the composition is injectable. 

Pharmaceutical composition according to anyone of claims 30 to 32, 
- e * aracte «*^^ that allows to direct 

and release the composition to a determined^ite^racticTK 

34. Pharmaceutical composition according to claim 33, 

characterized in that the vehicle is selected from liposomes, 
nanospheres, larger implantable containers and micropumps. 


35. 


Use of a pharmaceutical composition according to any one of claims 
23 to 34 for the prevention or treatment of any indications of the 
dimeric form of the protein. 
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Name: MP52, dimeric form 


Formula 

Molecular weight 
Amino acid composition 
Disulfide bond 


26994 Dalton 
238 amino acids 
7 bonds 


Primary structure 



1/2 
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Fig. 1 B 
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SEQUENCE LISTING 

<110> HyGene AG 

<120> Monomeric Protein of the TGF-beta Family 

<130> 20780pwomd 

<140> 
<141> 

<150> EP99115613 
<151> 1999-08-06 

<160> 8 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 2703 

<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (640) . . (2142) 
<400> 1 

ccatggcctc gaaagggcag cggtgatttt tttcacataa atatatcgca cttaaatgag 60 
tttagacagc atgacatcag agagtaatta aattggtttg ggttggaatt ccgtttccaa 120 
ttcctgagtt caggtttgta aaagattttt ctgagcacct gcaggcctgt gagtgtgtgt 180 
gtgtgtgtgt gtgtgtgtgt gtgtgtgtga agtattttca ctggaaagga ttcaaaacta 240 
gggggaaaaa aaaactggag cacacaggca gcattacgcc attcttcctt cttggaaaaa 300 
tccctcagcc ttatacaagc ctccttcaag ccctcagtca gttgtgcagg agaaaggggg 360 
cggttggctt tctcctttca agaacgagtt attttcagct gctgactgga gacggtgcac 420 
gtctggatac gagagcattt ccactatggg actggataca aacacacacc cggcagactt; 480 
caagagtctc agactgagga gaaagccttt ccttctgctg ctactgctgc tgccgctgct 540 
tttgaaagtc cactcctttc atggtttttc ctgccaaacc agaggcacct ttgctgctgc 600 
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cgctgttctc tttggtgtca ttcagcggct ggccagagg atg aga etc ccc aaa 654 

Met Arg Leu Pro Lys 
1 5 

etc etc act ttc ttg ctt tgg tac ctg get tgg ctg gac ctg gaa ttc 702 
Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala Trp Leu Asp Leu Glu Phe 
10 15 20 

ate tgc act gtg ttg ggt gec cct gac ttg ggc cag aga ccc cag ggg 750 
lie Cys Thr Val Leu Gly Ala Pro Asp Leu Gly Gin Arg Pro Gin Gly 
25 30 35 

acc agg cca gga ttg gec aaa gca gag gec aag gag agg ccc ccc ctg 798 
Thr Arg Pro Gly Leu Ala Lys Ala Glu Ala Lys Glu Arg Pro Pro Leu 
40 45 50 

gec egg aac gtc ttc agg cca ggg ggt cac age tat ggt ggg ggg gec 846 
Ala Arg Asn Val Phe Arg Pro Gly Gly His Ser Tyr Gly Gly Gly Ala 
55 60 65 

acc aat gee aat gee agg gca aag gga ggc acc ggg cag aca gga ggc 894 
Thr Asn Ala Asn Ala Arg Ala Lys Gly Gly Thr Gly Gin Thr Gly Gly 
70 75 80 85 

ctg aca cag ccc aag aag gat gaa ccc aaa aag ctg ccc ccc aga ccg 942 
Leu Thr Gin Pro Lys Lys Asp Glu Pro Lys Lys Leu Pro Pro Arg Pro 
90 95 100 


ggc ggc cct gaa ccc aag cca gga cac cct ccc caa aca agg cag get 990 
Gly Gly Pro Glu Pro Lys Pro Gly His Pro Pro Gin Thr Arg Gin Ala 
105 110 115 

aca gec egg act gtg acc cca aaa gga cag ctt ccc gga ggc aag gca 1038 
Thr Ala Arg Thr Val Thr Pro Lys Gly Gin Leu Pro Gly Gly Lys Ala 
120 125 130 

ccc cca aaa gca gga tct gtc ccc age tec ttc ctg ctg aag aag gee 1086 
Pro Pro Lys Ala Gly Ser Val Pro Ser Ser Phe Leu Leu Lys Lys Ala 
135 140 145 

agg gag ccc ggg ccc cca cga gag ccc aag gag ccg ttt cgc cca ccc 1134 
Arg Glu Pro Gly Pro Pro Arg Glu Pro Lys Glu Pro Phe Arg Pro Pro 
150 155 160 165 

ccc ate aca ccc cac gag tac atg etc teg ctg tac agg acg ctg tec 1182 
Pro lie Thr Pro His Glu Tyr Met Leu Ser Leu Tyr Arg Thr Leu Ser 
170 175 180 


2 
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gat get gac aga aag gga ggc aac age age gtg aag ttg gag get ggc 1230 
Asp Ala Asp Arg Lys Gly Gly Asn Ser Ser Val Lys Leu Glu Ala Gly 
185 190 195 

ctg gec aac ace ate ace age ttt att gac aaa ggg caa gat gac cga 1278 
Leu Ala Asn Thr He Thr Ser Phe He Asp Lys Gly Gin Asp Asp Arg 
200 205 210 

ggt ccc gtg gtc agg aag cag agg tac gtg ttt gac att agt gec ctg 1326 
Gly Pro Val Val Arg Lys Gin Arg Tyr Val Phe Asp He Ser Ala Leu 
215 220 225 

gag aag gat ggg ctg ctg ggg gec gag ctg egg ate ttg egg aag aag 1374 
Glu Lys Asp Gly Leu Leu Gly Ala Glu Leu Arg He Leu Arg Lys Lys 
230 235 240 245 

ccc teg gac acg gec aag cca gcg gec ccc gga ggc ggg egg get gec 1422 
Pro Ser Asp Thr Ala Lys Pro Ala Ala Pro Gly Gly Gly Arg Ala Ala 
250 255 260 

cag ctg aag ctg tec age tgc ccc age ggc egg cag ccg gec tec ttg 1470 
Gin Leu Lys Leu Ser Ser Cys Pro Ser Gly Arg Gin Pro Ala Ser Leu 
265 270 275 

ctg gat gtg cgc tec gtg cca ggc ctg gac gga tct ggc tgg gag gtg 1518 
Leu Asp Val Arg Ser Val Pro Gly Leu Asp Gly Ser Gly Trp Glu Val 
280 285 290 

ttc gac ate tgg aag etc ttc cga aac ttt aag aac teg gee cag ctg 1566 
Phe Asp He Trp Lys Leu Phe Arg Asn Phe Lys Asn Ser Ala Gin Leu 
295 300 305 

tgc ctg gag ctg gag gee tgg gaa egg ggc agg gec gtg gac etc cgt 1614 
Cys Leu Glu Leu Glu Ala Trp Glu Arg Gly Arg Ala Val Asp Leu Arg 
310 315 320 325 

ggc ctg ggc ttc gac cgc gee gec egg cag gtc cac gag aag gec ctg 1662 
Gly Leu Gly Phe Asp Arg Ala Ala Arg Gin Val His Glu Lys Ala Leu 
330 335 340 

ttc ctg gtg ttt ggc cgc ace aag aaa egg gac ctg ttc ttt aat gag 1710 
Phe Leu Val Phe Gly Arg Thr Lys Lys Arg Asp Leu Phe Phe Asn Glu 
345 350 355 

att aag gec cgc tct ggc cag gac gat aag ace gtg tat gag tac ctg 1758 
He Lys Ala Arg Ser Gly Gin Asp Asp Lys Thr Val Tyr Glu Tyr Leu 
360 365 370 
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ttc age cag egg cga aaa egg egg gee cca ctg gec act cgc cag ggc 1806 
Phe Ser Gin Arg Arg Lys Arg Arg Ala Pro Leu Ala Thr Arg Gin Gly 
375 380 385 


aag cga ccc age aag aac ctt aag get cgc tgc agt egg aag gca ctg 1854 
Lys Arg Pro Ser Lys Asn Leu Lys Ala Arg Cys Ser Arg Lys Ala Leu 
390 395 400 405 


cat gtc aac ttc aag gac atg ggc tgg gac gac tgg ate ate gca ccc 1902 
His Val Asn Phe Lys Asp Met Gly Trp Asp Asp Trp He He Ala Pro 
410 415 420 

ctt gag tac gag get ttc cac tgc gag ggg ctg tgc gag ttc cca ttg 1950 
Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu Cys Glu Phe Pro Leu 
425 430 435 


cgc tec cac ctg gag ccc acg aat cat gca gtc ate cag acc ctg atg 1996 
Arg Ser His Leu Glu Pro Thr Asn His Ala Val He Gin Thr Leu Met 
440 445 450 


aac tec atg gac ccc gag tec aca cca ccc acc nnn tgt gtg ccc acg 204 6 
Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr Xaa Cys Val Pro Thr 
455 460 465 


egg ctg agt ccc ate age ate etc ttc att gac tct gec aac aac gtg 2094 

Arg Leu Ser Pro He Ser He Leu Phe He Asp Ser Ala Asn Asn Val 

470 475 480 ifi5 


gtg tat aag cag tat gag gac atg gtc gtg gag teg tgt ggc tgc agg 2142 
Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly Cys Arg 
490 495 500 


tagcagcact ggccctctgt cttcctgggt 
ctggaatcac agaggggtca ggaagctgtg 
gggattccaa taagcttget cgctctctga 
ccacaagttc ccctggctga ggattgctgc 
aggtccaggg agacagactc tgaatgggac 


ggcacatccc aagagcccct tcctgcactc 2202 
gcaggagcat ctacacagct tgggtgaaag 2262 
gtgtgacttg ggctaaaggc ccccttttat 2322 
ccgtctgctg atgtgaccag tggcaggcac 2382 
tgagtcccag gaaacagtgc tttccgatga 24 42 


gactcagccc accatttctc ctcacctggg 
ctctcaggag agecacaggt gccactgcct 
ctgtccctgg gacagttgag aagctgactg 


ccttctcagc ctctggactc tcctaagcac 2502 
cctcaaatca catttgtgcc tggtgacttc 2562 
ggcaagagtg ggagagaaga ggagagggct 2622 
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gactgttaga tttaaatgta tattgatgag 2682 

2703 


<210> 2 

<211> 501 

<212> PRT 

<213> Homo sapiens 


<400> 2 

Met Arg Leu Pro Lys Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala Trp 
1 5 10 15 

Leu Asp Leu Glu Phe lie Cys Thr Val Leu Gly Ala Pro Asp Leu Gly 
20 25 30 

Gin Arg Pro Gin Gly Thr Arg Pro Gly Leu Ala Lys Ala Glu Ala Lys 
35 40 45 

Glu Arg Pro Pro Leu Ala Arg Asn Val Phe Arg Pro Gly Gly His Ser 
50 55 60 

Tyr Gly Gly Gly Ala Thr Asn Ala Asn Ala Arg Ala Lys Gly Gly Thr 
65 70 75 80 

Gly Gin Thr Gly Gly Leu Thr Gin Pro Lys Lys Asp Glu Pro Lys Lys 
85 90 95 


Leu Pro Pro Arg Pro Gly Gly Pro Glu Pro Lys Pro Gly His Pro Pro 
100 105 110 


Gin Thr Arg Gin Ala Thr Ala Arg Thr Val Thr Pro Lys Gly Gin Leu 
115 120 125 

Pro Gly Gly Lys Ala Pro Pro Lys Ala Gly Ser Val Pro Ser Ser Phe 
130 135 140 


Leu Leu Lys Lys Ala Arg Glu Pro Gly Pro Pro Arg Glu Pro Lys Glu 
145 150 155 160 

Pro Phe Arg Pro Pro Pro He Thr Pro His Glu Tyr Met Leu Ser Leu 

165 170 175 


Tyr Arg Thr Leu Ser Asp Ala Asp Arg Lys Gly Gly Asn Ser Ser Val 
180 185 190 


Lys Leu Glu Ala Gly Leu Ala Asn Thr He Thr Ser Phe He Asp Lys 


5 


WO 01/11041 

195 


200 
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205 


Gly Gin Asp Asp Arg Gly Pro Val Val Arg Lys Gin Arg Tyr Val Phe 
210 215 220 

Asp lie Ser Ala Leu Glu Lys Asp Gly Leu Leu Gly Ala Glu Leu Arg 
225 230 235 240 

He Leu Arg Lys Lys Pro Ser Asp Thr Ala Lys Pro Ala Ala Pro Gly 
245 250 255 

Gly Gly Arg Ala Ala Gin Leu Lys Leu Ser Ser Cys Pro Ser Gly Arg 
260 265 270 

Gin Pro Ala Ser Leu Leu Asp Val Arg Ser Val Pro Gly Leu Asp Gly 
275 280 285 

Ser Gly Trp Glu Val Phe Asp He Trp Lys Leu Phe Arg Asn Phe Lys 
290 295 300 

Asn Ser Ala Gin Leu Cys Leu Glu Leu Glu Ala Trp Glu Arg Gly Arg 
305 310 315 320 

Ala Val Asp Leu Arg Gly Leu Gly Phe Asp Arg Ala Ala Arg Gin Val 
325 330 335 

His Glu Lys Ala Leu Phe Leu Val Phe Gly Arg Thr Lys Lys Arg Asp 
340 345 350- 


Leu Phe Phe Asn Glu He Lys Ala Arg Ser Gly Gin Asp Asp Lys Thr 
355 360 365 

Val Tyr Glu Tyr Leu Phe Ser Gin Arg Arg Lys Arg Arg Ala Pro Leu 
370 375 380 

Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu Lys Ala Arg Cys 
385 390 395 400 

Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly Trp Asp Asp 
405 410 415 

Trp He He Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu 
420 425 430 

Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala Val 
435 440 445 


lie Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr 
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. 450 455 460 

Xaa Cys Val Pro Thr Arg Leu Ser Pro He Ser He Leu Phe He Asp 
465 470 475 480 

Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu 
485 490 495 

Ser Cys Gly Cys Arg 
500 


<210> 3 

<211> 2272 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (128) (1183) 
<400> 3 

caaggagcca tgccagctgg acacacactt cttccagggc ctctggcagc caggacagag 60 

ttgagaccac agctgttgag accctgagcc ctgagtctgt attgctcaag aagggccttc 120 

cccagca atg acc tec tea ttg ctt ctg gee ttr etc etc ctg get cca 169 
Met Thr Ser Ser Leu Leu Leu Ala. Phe Leu Leu Leu Ala Pro 
1 5 10 

acc aca gtg gec act ccc aga get ggc ggt cag tgt cca gca tgt ggg 217 
Thr Thr Val Ala Thr Pro Arg Ala Gly Gly Gin Cys Pro Ala Cys Gly 
15 20 25 30 

ggg ccc acc ttg gaa ctg gag age cag egg gag ctg ctt ctt gat ctg 265 
Gly Pro Thr Leu Glu Leu Glu Ser Gin Arg Glu Leu Leu Leu Asp Leu 
35 40 45 

gec aag aga age ate ttg gac aag ctg cac etc acc cag cgc cca aca 313 
Ala Lys Arg Ser He Leu Asp Lys Leu His Leu Thr. Gin Arg Pro Thr 
50 55 60 

ctg aac cgc cct gtg tec aga get get ttg agg act gca ctg cag cac 361 
Leu Asn Arg Pro Val Ser Arg Ala Ala Leu Arg Thr Ala Leu Gin His 
65 70 75 

etc cac ggg gtc cca cag ggg gca ctt eta gag gac aac agg gaa cag 409 
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Leu His Gly Val Pro Gin Gly Ala Leu Leu Glu Asp Asn Arg Glu Gin 
80 85 90 

gaa tgt gaa ate ate age ttt get gag aca ggc etc tec acc ate aac 457 
Glu Cys Glu lie lie Ser Phe Ala Glu Thr Gly Leu Ser Thr lie Asn 
95 100 105 110 

cag act cgt ctt gat ttt cac ttc tec tct gat aga act get ggt gac 505 
Gin Thr Arg Leu Asp Phe His Phe Ser Ser Asp Arg Thr Ala Gly Asp 
115 120 125 

agg gag gtc cag cag gec agt etc atg ttc ttt gtg cag etc cct tec 553 
Arg Glu Val Gin Gin Ala Ser Leu Met Phe Phe Val Gin Leu Pro Ser 
130 135 140 

aat acc act tgg acc ttg aaa gtg aga gtc ctt gtg ctg ggt cca cat 601 
Asn Thr Thr Trp Thr Leu Lys Val Arg Val Leu Val Leu Gly Pro His 
145 150 155 

aat acc aac etc acc ttg get act cag tac ctg ctg gag gtg gat gee 64 9 
Asn Thr Asn Leu Thr Leu Ala Thr Gin Tyr Leu Leu Glu Val Asp Ala 
160 165 170 

agt ggc tgg cat caa etc ccc eta ggg cct gaa get caa get gee tgc 697 
Ser Gly Trp His Gin Leu Pro Leu Gly Pro Glu Ala Gin Ala Ala Cys 
175 180 185 190 


age cag ggg cac ctg acc ctg gag ctg gta ctt gaa ggc cag gta gee 745 

ser uxn Gly Hxs Leu Thr Leu Glu Leu Val Lau Glu Gly Glu Val Ala 

195 200 205 

cag age tea gtc ate ctg ggt gga get gec cat agg cct ttt gtg gca 793 
Gin Ser Ser Val lie Leu Gly Gly Ala Ala His Arg Pro Phe Val Ala 
210 215 220 

gec egg gtg aga gtt ggg ggc aaa cac cag att cac cga cga ggc ate 841 
Ala Arg Val Arg Val Gly Gly Lys His Gin He His Arg Arg Gly He 
225 230 235 

gac tgc caa gga ggg tec agg atg tgc tgt cga caa gag ttt ttt gtg 889 
Asp Cys Gin Gly Gly Ser Arg Met Cys Cys Arg Gin Glu Phe Phe Val 
240 245 250 

gac ttc cgt gag att ggc tgg cac gac tgg ate ate cag cct gag ggc 937 
Asp Phe Arg Glu He Gly Trp His Asp Trp He He Gin Pro Glu Gly 
255 260 265 270 

tac gee atg aac ttc tgc aza ggg cag tgc cca eta cac ata gca ggc 985 
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Tyr Ala Met Asn Phe Cys II Gly Gin Cys Pro Leu His He Ala Gly 
275 280 285 

atg cct ggt att get gec tec ttt cac act gca gtg etc aat ctt etc 1033 
Met Pro Gly He Ala Ala Ser Phe His Thr Ala Val Leu Asn Leu Leu 
290 295 300 

aag gee aac aca get gca ggc acc act gga ggg ggc tea nnn tgt gta 1081 
Lys Ala Asn Thr Ala Ala Gly Thr Thr Gly Gly Gly Ser Xaa Cys Val 
305 310 315 

ccc acg gec egg cgc ccc ctg tct ctg etc tat tat gac agg gac age 1129 
Pro Thr Ala Arg Arg Pro Leu Ser Leu Leu Tyr Tyr Asp Arg Asp Ser 
320 325 330 

aac att gtc aag act gac ata cct gac atg gta gta gag gec tgt ggg 1177 
Asn He Val Lys Thr Asp He Pro Asp Met Val Val Glu Ala Cys Gly 
335 340 345 350 

tgc agt tagtctatgt gtggtatggg cagcccaagg ttgcatggga aaacacgccc 1233 
Cys Ser 

ctacagaagt gcacttcctt gagaggaggg aatgacctca ttctctgtcc agaatgtgga 1293 
ctccctcttc ctgagcatct tatggaaatt accccacctt tgacttgaag aaaccttcat 1353 
ctaaagcaag tcactgtgcc atcttcctga ccactaccct ctttcctagg gcatagtcca 1413 
tcccgctagt ccatcccgct agccccactc cagggactca gacccatctc caaccatgag 1473 
caatgccatc tggttcccag gcaaagacac ccttagctca cctttaatag accccataac 1533 
ccactatgcc ttcctgtcct ttctactcaa tggtccccac tccaagatga gttgacacaa 1593 
ccccttcccc caatttttgt ggatctccag agaggecett ctttggattc accaaagttt 1653 
agatcactgc tgcccaaaat agaggcttac ctacccccct ctttgttgtg agcccctgtc 1713 
cttcttagtt gtccaggtga actactaaag ctctctttgc ataccttcat ccattttttg 1773 
tccttctctg cctttctcta tgcccttaag gggtgacttg cctgagctct atcacctgag 1833 
ctcccctgcc ctctggcttc ctgctgaggt cagggcattt cttatccctg ttccctctct 1893 
gtctaggtgt catggttctg tgtaactgtg gctattctgt gtccctacac tacctggcta 1953 
cccccttcca tggccccagc tctgcctaca ttctgatttt tttttttttt ttttttttga 2013 
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aaagttaaaa attccttaat tttttattcc tggtaccact accacaattt acagggcaat 2073 

atacctgatg taatgaaaag aaaaagaaaa agacaaagct acaacagata aaagacctca 2133 

ggaatgtaca tctaattgac actacattgc attaatcaat agctgcactt tttgcaaact 2193 

gtggctatga cagtcctgaa caagaagggt ttcctgttta agctgcagta acttttctga 2253 
cratggatca tcgttcctt 2272 


<210> 4 • 
<211> 352 
<212> PRT 
<213> Homo sapiens 

<400> 4 

Met Thr Ser Ser Leu Leu Leu Ala Phe Leu Leu Leu Ala Pro Thr Thr 
1 5 10 15 

Val Ala Thr Pro Arg Ala Gly Gly Gin Cys Pro Ala Cys Gly Gly Pro 
20 25 30 

Thr Leu Glu Leu Glu Ser Gin Arg Glu Leu Leu Leu Asp Leu Ala Lys 
35 40 45 


Arg Ser lie Leu Asp Lys Leu His L eu Thr Gin Arg Pro Thr Leu Asn 
5U 55~ 60 


Arg Pro Val Ser Arg Ala Ala Leu Arg Thr Ala Leu Gin His Leu His 
65 70 75 80 

Gly Val Pro Gin Gly Ala Leu Leu Glu Asp Asn Arg Glu Gin Glu Cys 
85 90 95 

Glu He He Ser Phe Ala Glu Thr Gly Leu Ser Thr He Asn Gin Thr 
100 105 110 

Arg Leu Asp Phe His Phe Ser Ser Asp Arg Thr Ala Gly Asp Arg Glu 
115 120 125 

Val Gin Gin Ala Ser Leu Met Phe Phe Val Gin Leu Pro Ser Asn Thr 
130 135 140 

Thr Trp Thr Leu Lys Val Arg Val Leu Val Leu Gly Pro His Asn Thr 
145 150 155 160 

Asn Leu Thr Leu Ala Thr Gin Tyr Leu Leu Glu Val Asp Ala Ser Gly 
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165 170 175 

Trp His Gin Leu Pro Leu Gly Pro Glu Ala Gin Ala Ala Cys Ser Gin 
180 185 190 

Gly His Leu Thr Leu Glu Leu Val Leu Glu Gly Gin Val Ala Gin Ser 
195 200 205 

Ser Val He Leu Gly Gly Ala Ala His Arg Pro Phe Val Ala Ala Arg 
210 215 220 

Val Arg Val Gly Gly Lys His Gin He His Arg Arg Gly He Asp Cys 
225 230 235 240 

Gin Gly Gly Ser Arg Met Cys Cys Arg Gin Glu Phe Phe Val Asp Phe 
245 250 255 

Arg Glu He Gly Trp His Asp Trp He He Gin Pro Glu Gly Tyr Ala 
260 265 270 I 

Met Asn Phe Cys He Gly Gin Cys Pro Leu His He Ala Gly Met Pro 
275 280 285 

Gly He Ala Ala Ser Phe His Thr Ala Val Leu Asn Leu Leu Lys Ala 
290 295 300 

Asn Thr Ala Ala Gly Thr Thr Gly Gly Gly Ser Xaa Cys Val Pro Thr 
305 310 315 320 

Ala Arg Arg Pro Leu Ser Leu Leu Tyr Tyr Asp Arg Asp Ser Asn He 
325 330 335 

Val Lys Thr Asp He Pro Asp Met Val Val Glu Ala Cys Gly Cys Ser 
340 345 350 


<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 5 

ccacaccacc caccgcctgt gtgcccacgc 30 
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<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

• <400> 6 
gcgtgggcac acaggcggtg ggtggtgtgg 


<210> 7 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 7 

ctggaggggg ctcagcctgt gtacccacgg 


<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutagenic 
primer 

<400> 8 

ccgtgggtac acaggctgag ccccctccag 
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